OCR Xpress for Java makes it very easy to create a full page OCR application. For the basics, there are two ways to produce searchable text documents from images:
In addition, text in memory can be generated for the OCR Xpress internal data.
Copy Code
|
|
---|---|
//Read in the image from inputImagePath BufferedImage bufferedImg = null; try { bufferedImg = ImageIO.read(new File(inputImagePath)); } catch (IOException e) { e.printStackTrace(); return; } |
Copy Code
|
|
---|---|
RecognitionParameters parameters = new RecognitionParameters(); parameters.setLanguage(Language.ENGLISH); OcrXpress ocrx = new OcrXpress(); |
Copy Code
|
|
---|---|
ocrx.recognizeToFile(parameters, bufferedImg, FileFormat.PDF, FileMode.OVERWRITE, “PdfFileName.pdf”); |
Copy Code
|
|
---|---|
ocrx.recognizeToFile(parameters, bufferedImg, FileFormat.TEXT, FileMode.OVERWRITE, “TextFileName.txt”); |
Copy Code
|
|
---|---|
Document document = ocrx.recognizeToMemory(parameters, bufferedImg); |